AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Vision-Text Generation

# Vision-Text Generation

Vit GPT2 Image Captioning
An image captioning model based on the ViT-GPT2 architecture, capable of generating natural language descriptions for input images.
Image-to-Text Transformers
V
motheecreator
149
0
Vit GPT2 Image Captioning
An image captioning model based on the ViT-GPT2 architecture, capable of generating natural language descriptions for input images.
Image-to-Text Transformers
V
mo-thecreator
17
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase